SUBMITTED TO ApJ 

Preprint typeset using I^T^^ style cmulatcapj v. 9/08/03 



GRAVITATIONAL LENSING STATISTICS IN UNIVERSES DOMINATED BY DARK ENERGY 
Michael Kuhlen\ Charles R. Keeton^-', & Piero Madau^ 

submitted to ApJ 

ABSTRACT 

The distribution of image separations in multiply- imaged gravitational lens systems can simultane- 
ously constrain the core structure of dark matter halos and cosmological parameters. We study lens 
statistics in flat, low-density universes with different equations of state w = Pq/pq for the dark energy 
component. The fact that dark energy modifies the distance-redshift relation and the mass function of 
dark matter halos leads to changes in the lensing optical depth as a function of image separation AO. 
Those effects must, however, be distinguished from effects associated with the structure of dark matter 
halos. Baryonic cooling causes galaxy-mass halos to have different central density profiles than group- 
and cluster-mass halos, which causes the distribution of normal arcsecond-scale lenses to differ from 
the distribution of "wide-separation" {A9 > 4") lenses. Fortunately, the various parameters related to 
cosmology and halo structure have very different effects on the overall image separation distribution: 
(1) the abundance of wide-separation lenses is exremely sensitive (by orders of magnitude) to the 
distribution of "concentration" parameters for massive halos modeled with the Navarro-Frenk- White 
profile; (2) the transition between normal and wide-separation lenses depends mainly on the mass scale 
where baryonic cooling ceases to be efficient; and (3) dark energy has effects at all image separation 
scales. While current lens samples cannot usefully constrain all of the parameters, ongoing and future 
imaging surveys should discover hundreds or thousands of lenses and make it possible to disentangle 
the various effects and constrain all of the parameters simultaneously. Incidentally, we mention that 
for the sake of discovering lensed quasars, survey area is more valuable than depth. 
Subject headings: cosmological parameters — cosmology: theory — dark matter — galaxies: halos — 
gravitational lensing 



1. INTRODUCTION 

I Cold dark matter (CDM) theory makes robust predic- 
tions on the number density, spatial distribution, and 
structural properties of dark matter halos that must 
be compared with observational data to test the CDM 
paradigm. Given a large well-defined sample of strong 
gravitational lens systems, the distribution of image sep- 

[ arations Ad is a powerful and direct probe of the halo 
mass function and inner density profiles. This probe 
is attractive for being independent of the uncertainties 
about the relation between mass and luminosity that 

I plague most astrophysical tools. As an example, the 
statistics of wide-separation {A9 > 6") lenses constrain 
the core mass fraction of dark matter halos on group 
and cluster mass scales, which depends on the "concen- 
tration" and slope of the central density cusp (Keeton 
& Madau 2001; also see Flores & Primack 1996), both 
of which are still uncertain and controversial (Navarro, 
Frenk, & White 1997; Moore et al. 1999; Jing & Suto 
2000; Ghigna et al. 2000). On smaller, galaxy mass 
scales (corresponding to A9 of a few arcseconds) , the test 
is complicated by the presence of cooled baryons; when 
baryons cool and condense into a galaxy they modify the 
surrounding dark matter halo (e.g., Blumenthal et al. 
1986). 

The statistics of gravitational lensing are also sensitive 
to cosmological parameters, since these determine the 
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angular diameter distances to the lens and the source, 
and the number density of lens galaxies. Observations of 
distant Type la supernovae (Riess et al. 2001; Perlmut- 
ter et al. 1999), combined with measurements of cosmic 
microwave background (CMB) anisotropies (e.g., Spergel 
et al. 2003; de Bernardis et al. 2002; Pryke et al. 2002; 
Balbi et al. 2000), provide strong evidence that the domi- 
nant component in the universe — the exotic dark energy 
— is not associated with matter, has negative pressure, 
and is causing the cosmic expansion to accelerate. While 
several independent techniques appear to have converged 
rather tightly on a "concordance" model with parameters 
f^tot — 1, — 0.3, h = 0.7, and n = 1, determining the 
equation of state of the dark energy remains one of the 
greatest challenges in cosmology and physics today. This 
may prove difficult with supernovae data alone (Gerke & 
Efstathiou 2002; Efstathiou 1999; Perlmutter, Turner, 
& White 1999), and additional observations (like CMB 
anisotropies, see e.g. Frieman et al. 2002) may be re- 
quired to determine the nature of the negative pressure 
component. 

Gravitational lensing statistics have already been used 
as another probe of the cosmic equation of state (Cooray 
& Huterer 1999; Waga & Friemann 2000; Sarbu, Rusin, 
& Ma 2001; Chae et al. 2002; Huterer & Ma 2003). Dark 
energy modifies the background cosmological line ele- 
ment, which affects the lensing geometry. It also modifies 
the power spectrum of density fluctuations on large scales 
(Ma et al. 1999), the rate of structure growth, the critical 
overdensity for spherical top-hat collapse, and the over- 
density at virialization (Wang & Steinhardt 1998; Wein- 
berg & Kamionkowski 2002), all of which affect the mass 
function and the internal structure of collapsed dark mat- 
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ter halos, with consequences for the lensing cross section 
and the probabihty for muhiple images. Because dark 
energy varies with redshift more slowly than matter, it 
starts contributing significantly to the expansion only rel- 
atively recently, at 2; < 1, where the lensing optical depth 
to distant quasars actually peaks. 

In this paper we explore the ability of lens statistics to 
simultaneously constrain the core structure of dark mat- 
ter halos, including both the concentration and the cool- 
ing mass scale, as well as the halo mass function and the 
cosmic equation of state. While other recent studies have 
examined various aspects of the problem, we consider all 
of the effects simultaneously to examine whether lensing 
can really distinguish between them, and whether lensed 
quasars can effectively be used to draw inferences on the 
background cosmology. The outline of the paper is as fol- 
lows. We first present the ingredients of our calculations: 
the formalism for lens statistics (§2), and a description 
of structure formation under the influence of dark energy 
(§3). We then study how the various parameters in our 
model affect the results (§4). Next, we show that cur- 
rent lens data can test the model and constrain some of 
the parameters (§5). Finally, we argue that ongoing and 
future surveys should dramatically increase the samples 
of known lenses and provide powerful constraints on the 
parameters relating to dark energy and to the core struc- 
ture of dark matter halos (§6). We offer our conclusions 
in §7. 

2. LENSING METHODS 

In this section we present the formalism for lens statis- 
tics calculations. In §2.1 wc discuss the calculation of var- 
ious lensing probability distributions. In §2.2 we describe 
our model for the internal structure of dark matter ha- 
los, in which halos corresponding to normal galaxies are 
treated as singular isothermal spheres (SIS), while lower 
and higher mass halos are assumed to have the Navarro- 
Frenk- White (1997, hereafter NFW) profile. In §§2.3-2.4 
we review the lensing properties of SIS and NFW halos. 

2.1. Probability calculations 

We follow standard methods for computing lensing 
statistics (e.g., Narayan & White 1988; Kochanek 1995; 
Porciani fc Madau 2000; Keeton & Madau 2001; Li & Os- 
triker 2002). Assume that the geometry of the imiverse 
is well approximated on large scales by the Friedmann- 
Robertson- Walker metric. Consider a population of ob- 
jects that can act as deflectors, which lie at redshift zi, 
have mass M, and may be characterized by a set of ad- 
ditional parameters p (such as a "concentration" param- 
eter as defined in §2.4). Let dn/dM dphe the differential 
conioving number density of deflectors with the speci- 
fied properties. The differential probability that a point 
source at redshift Zg with flux S is lensed into multiple 
images by the deflector population is then (e.g.. Turner, 
Ostriker,' & Gott 1984) 



dP .„cdt dn 
- = {l + zif 



dzdMdp dz dM dp 

xaL{zs,zi,M,p)B{S;Zs,zi,M,p) 

where 

cdt c 



dz (1 + z)H{z) 



(1) 
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is the cosmological line element, where H = a/a is the 
Hubble parameter and a = {1 + z)^^ is the scale fac- 
tor. In eq. (1), aL{zs, Zi, M,p) is the cross section for 
multiple imaging, and B{S; Zg, zi, M,p) is the "magni- 
fication bias" factor representing the fact that magni- 
fication causes lensed objects to be over-represented in 
flux- limited surveys. 

The total lensing probability P{zs) can be computed 
by integrating eq. (1) over redshift, mass, and the addi- 
tional deflector parameters p. The distribution of lensing 
observables, such as the separation between the images, 
can be determined by inserting a selection function in the 
integral. For example, the probability of lensing with an 
image separation greater than is 



P{>M;zs) 



dP 



(3) 

where Ti.{A6] Zs, zi, M,p) is unity if the parameters cor- 
respond to a lens with image separation greater than A6', 
and zero otherwise. The probability of lensing with par- 
ticular values of the time delay or the lens redshift can 
be computed in the analogous way. 

We have computed the integrals with direct integra- 
tion, and also with Monte Carlo methods where we sum 
the integrand for random values of the integration vari- 
ables. The advantage of the Monte Carlo approach is 
the ability to compute the distributions of image separa- 
tions, time delays, and lens redshifts simultaneously, but 
the disadvantage is the need for large-number realiza- 
tions to reduce the statistical fluctuations in the results. 
We have verified that the two approaches produce equiv- 
alent results. 

2.2. Overview of the model 

The lensing cross section and magnification bias in 
eq. (1) depend on the radial density profile of the de- 
flector. Two standard models for the density profile are 
the singular isothermal sphere (SIS) and the Navarro- 
Frenk-White (1997, hereafter NFW) profile. We can- 
not simply choose one or the other, however. Assum- 
ing that all massive objects have the same profile (be 
it SIS or NFW) produces a distribution of lensed image 
separations that is grossly inconsistent with the data at 
Ae ~ 1"-10" (Flores & Primack 1996; Keeton 1998; Por- 
ciani & Madau 2000; Li & Ostriker 2002). The simplest 
model that agrees with the data has all objects below 
some mass Mcooi assumed to be SIS, and all halos above 
Mcooi assumed to be NFW. The transition from SIS to 
NFW is not ad hoc, but rather motivated by baryonic 
cooling. In large halos the baryons have not had time 
to cool so the systems retain their initial NFW form; 
while in small halos the gas has been able to cool, sink 
to the center of the halo, and create a more concentrated 
profile that can be approximated as SIS (Kochanek & 
White 2001). The lens data require a mass threshold 
Mcoo\ ^ 10"'^^ h^^ Mq, which is consistent with the halo 
mass whose cooling time equals the age of the universe, 
and also represents the mass scale of the transition from 
galaxies to groups and clusters. 

Recently Li & Ostriker (2002) and Ma (2003) pointed 
out that similar arguments also apply on the small- 
separation and low-mass end: there must be a transition 
from SIS back to NFW (or a related form) as the mass 
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is decreased from normal galaxies 10^^ ^^0) down to 
dwarfs. This transition reflects the possibility that feed- 
back effects or reionization could lead to a suppression 
of baryonic collapse in low- mass halos (Springel & Hern- 
quist 2003; Bullock, Kravtsov, & Weinberg 2002) 

We adopt the two-transition-mass approach, where ha- 
los below mass Md ("dwarfs") have NFW profiles, halos 
between Md and Mc have SIS profiles, and halos above 
Mc ("clusters") have NFW profiles. Some studies (e.g., 
Li & Ostriker 2002) have used generalized NFW profiles 
(Zhao 1996), where the central cusp is steeper than for 
NFW, for dwarfs. We always use the NFW form for 
simplicity, but our results should apply to models with 
steeper cusps as long as they have the same value of the 
core mass fraction (Keeton & Madau 2001). We consider 
only spherical halos because they are adequate for pre- 
dicting the number of lenses (e.g., Keeton, Kochanek, & 
Seljak 1997; Chae 2002). 

2.3. SIS lens 

There is now strong evidence that the mass distribu- 
tions of early-type galaxies within several optical radii 
are well described by the SIS profile; the evidence comes 
from lensing (Cohn et al. 2001; Rusin, Kochanek, & Kee- 
ton 2003), X-rays (Fabbiano 1989), stellar dynamics (Rix 
et al. 1997), and even joint lensing and stellar dynami- 
cal analyses (Koopmans & Treu 2003; Treu & Koopmans 
2003). An SIS halo has density profile p = a^/{27rGr'^), 
where cr„ is the velocity dispersion, which is related to 
the total mass M within the virial radius by 



a,{M,z) = -H^ron]i'Al(^{l + z) 



a/2 



(4) 



Here ro = {SM/iTrpoY^^ is the comoving radius of the 
collapsing perturbation, z denotes the virialization epoch 
of the halo, and Avir(2:) is the virial overdensity of the 
halo (see §3.6). SIS models are fully described by the 
mass and redshift, so there are no additional parameters 
p and the factor dn/dM dp in cq. (1) is just the mass 
function dn/dM. 
An SIS lens has an angular Einstein radius 



9, = 4.pV§l, 
V c / £»„., 



(5) 



where Dig and Dqs are angular diameter distances from 
the lens to the source and observer to the source, re- 
spectively. The image separation is A9 = 29e, and the 
lensing cross section is crgjg = 'jt{9eDoi)^ where Doi is 
the angular diameter distance from the observer to the 
lens. The magnification bias is 



dyy 



N{>S/p{y)) 
N{>S) 



(6) 



where N{> S) is the number of sources with flux greater 
than S. The magnification p{y) can be the magnification 
of either of the two images {p± = Oe/v ± 1) or the total 
magnification {ptot ~ '^Qe/v), depending on the details 
of a lens survey. For example, when studying the CLASS 
survey (§5) we use the total magnification because the 
targets in the initial catalog were unresolved. Combin- 
ing this with a source luminosity function modeled as a 
power law dN/ dS cx S'^ would yield a magnification bias 
oiB = 2-'^/(3 + u). 



2.4. NFW lens 

High-resolution A''-body simulations of the formation 

of dark matter halos consistently produce halo density 
profiles shallower that SIS in the core (NFW; Moore et al. 
1999). NFW argue that the halos form a two-parameter 
family described by the density profile 

^^"■^ ^ {r/rs){l + r/rsf ' 

where ps is a characteristic density and is a scale ra- 
dius. In many cases it is more convenient to take as the 
two free parameters the virial mass M and a "concen- 
tration" parameter c = r^ir/rs- Note that some authors 
define the concentration as c = r2oo /fg , where r2oo is the 
radius at which the mean density of the halo is equal to 
200 times the critical density (NFW; Li & Ostriker 2002). 
This definition is independent of cosmology, whereas the 
definition in terms of the virial radius takes into account 
differences in the virial overdensity for different cosmolo- 
gies (see §3.6) 

The two parameters are not actually independent; Bul- 
lock et al. (2001) found that the concentration parameter 
follows a log-normal distribution where the median de- 
pends on the halo mass and redshift, 

c„.ed(M,z) = -^f^V, (8) 
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where M^ is the mass of a typical halo collapsing today 
(see §3.4). Thus, for NFW halos we can write the factor 
dn/dM dp in eq. (1) as 



dn 



dMdlogc 



dn 
dM 



27rc7iog, 



exp 



(logC- logdned)^ 



2<c 



(9) 

The halos in the ACDM simulations by Bullock et al. 
were best described by a = —0.13 and = 9.0, with a 
scatter around the median of diogc = 0.14 dex. (Note 
that Bullock et al. (2001) incorrectly quote aiogc = 
0.18 dex. This error has been corrected in Wechsler et al. 
(2002).) We find that at large separations the abimdance 
of lenses is very sensitive to both and aiogc, so we treat 
both of them as model parameters. Other models for the 
concentration distribution have been discussed, but they 
generally agree out to 2; ~ 1 (Eke, Navarro, & Steinmetz 
2001; Wechsler et al. 2002; Zhao et al. 2003a, 2003b). 

Bartelmann (1996) derives the lensing properties of 
NFW halos, and Oguri et al. (2002) give useful analytic 
approximations for the image separation and magnifica- 
tion. The lens equation has the form 



y 



Ps 



(10) 



where x is the magnitude of the position vector in the lens 
plane, scaled by r^, and y the magnitude of the position 
vector in the source plane, scaled by VgDoi/ Dqs- The 
lensing efficiency is parameterized by ps = ^PsTs/^c^, 
where 

c2 D„., 



47rG DoiDu ^^^^ 

is the critical surface mass density for lensing (Turner et 
al. 1984). Lastly, 

S(x'), 



9{x) 



Jo 



-x'dx' , 



(12) 
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which for the NFW profile evaluates to: 



In^ 
2 



VT- 
1, 



f arctanhVl — x^. 
j= arctan Va;^ — 1, 



< a; < 1, 
X = 1, 
x>l. 



(13) 

The Einstein radius xe of the lens is the root of 

y{x). The "radial critical radius" Xcr is the root of 
dy/dx. It maps to the radial caustic T/cr = |j/(a^cr)|- Any 
source with \y\ < yet is strongly Icnscd into three im- 
ages with |a;i| > \x2\ > \xz\- The image separation is 
= (|xi| + \x2\)rs/ Doi, so it depends on the source 
position \y\. However, Oguri et al. (2002) and Li & Os- 
triker (2002) argue that the separation is well approxi- 
mated by /S.9 K, 2xErs/ Doi- Wc find that our results arc 
indistinguishable when we use this approximation and 
when we use the exact image separation found by explic- 
itly solving the lens equation. The lensing cross section 



IS a 



NFW 



TT (yore's) ) and the magnification bias is 



B{S) 



_ 2 n 

Vcr Jo 



dyy 



N{>S/fi{y)) 
N{>S) 



(14) 



where N{> S) is the number of sources with flux greater 
than S*, and /i(2/) is the magnification of a source at im- 
pact parameter y. As in §2.3, the magnification may be 
that of either image or the total magnification, depend- 
ing on the details of a lens survey. The magnification 
must be found by explicitly solving the lens equation 
and then using the fact that an image at position x has 
magnification fi = [y{x)/x]~^[dy /dx]~^ . 

3. STRUCTURE FORMATION IN QCDM 

In this section wc summarize the main features of 
structure formation in a universe filled with dark mat- 
ter and dark energy. For calculations of lens statistics, 
the quantities we need are the geometry of the universe 
(§3.1), the mass function of halos (§3.5) and the virial 
overdensity (§3.6). 

While the nature of the dark energy remains unknown, 
a cosmological constant A or a dynamical scalar field 
called "quintessence" (Caldwell, Dave, and Steinhardt 
1998; Peebles & Ratra 1988) are the most discussed 
candidates. The effective equation of state of the dark 
energy is usually parameterized as w — Pq/pQi where 
w = —1 for a pure cosmological constant but may be 
larger for a scalar field component or smaller for phan- 
tom energy (Caldwell et al. 2003); w may be constant 
or time-varying. If w is assumed to be constant in 
time, the combined data sets from Type la supernovae 
(SNe) and the CMB or large-scale structure already im- 
ply — 1.30 < w < —0.6 (Spergcl et al. 2003; Baccigalupi 
et al. 2002; Wang et al. 2000). Our calculations apply 
to models with a constant or slowly varying equation of 
state (Steinhardt, Wang, & Zlatcv 1999). Wc generically 
refer to dark energy as a "Q-component" , and to cosmo- 
logical models with a mixture of dark energy and cold 
dark matter (CDM) as QCDM. ACDM is then a special 
case of QCDM with w = - I. 

3.1. Cosmography 

In the limit of a nearly homogeneous Q-component, all 
the consequences of the Q-field follow from its effect on 



the expansion rate: 



= nM{l + zY + ilQ{l + z) 



3(1-1-11)) 



(15) 

where Hq is the Hubble parameter today, fijvf and fig are 

the matter and dark energy density parameters today, 
and Q.K = 1 — ^ is the contribution of curvature. 
In an il/f = flat universe, the angular diameter distance 
from an object at redshift z\ to an object at redshift 22 
is given by 

r dz 
DA{z„Z2) = r^ (16) 

J Z-l 



I + Z2 



H{z) 



3.2. Linear growth of perturbations 

In flrst-order perturbation theory, the growing-mode 
solution of the matter density contrast satisfies the dif- 
ferential equation 



iD + 2H{z)b = -Hl^Mil + zfD , 



(17) 



where D is the linear growth factor and the dots de- 
note derivatives with respect to time. An analytical so- 
lution in terms of hypergeometric functions can be found 
in Padmanabhan (2003). A good approximation to the 
instantaneous growth index / = d\n.D/d\na as the mat- 
ter density parameter Q.m{z) = f2M(l + z)^{Hq/H^) ap- 
proaches unity is given by 



/ w Om(^)^ 



PAliz) 



Pm{z)+ Pq{z) 



(18) 



where pQ and pM are the Q-field and matter densi- 
ties, respectively (Silveira & Waga 1994; Wang & Stein- 
hardt 1998). We see that the growth of density pertur- 
bations is slowed as pq approaches pM- Growth suppres- 
sion increases with increasing w since the onset of dark 
energy domination occurs at higher redshift: in order 
to achieve the level of mass fluctuations observed today, 
cosmic structure in QCDM must form at earlier times 
(Huterer & Turner 2001). 

3.3. Power spectrum 

Dark halos form from primordial matter density fluc- 
tuations in the early universe. The Q-field is so light 
that on scales less than a few hundred Mpc fluctuations 
in Q disperse relativistically, and quintessence behaves 
as a smooth component like the cosmological constant. 
On very large scales, however, the dark energy may clus- 
ter gravitationally. Fluctuations in the Q-field add to 
the right-hand side of of equation (17), resulting in a 
different growth rate once quintessence starts to dom- 
inate the cosmological energy density. This change of 
behavior is incorporated in the transfer function for the 
matter density field, which accounts for all modifications 
of the primordial power-law spectrum due to the effects 
of pressure and dissipative processes, together with the 
gravitational clustering of matter and dark energy. The 
linear power spectrum for matter density perturbations 
in QCDM scenarios can be expressed as 



P{k,z;w) = Ak''T'^{k,z;w) 



D{z;w) 



Dn 



(19) 
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where A is an overall normalization, k is the wavcnumber, 
n is the spectral index of the post-inflationary adiabatic 
density perturbation, Tq(A:, z; w) is the transfer function, 
and D is the growth factor from equation (17). 

Ma et al. (1999) provide a prescription in which the 
modification due to quintessence with «; > — 1 are cap- 
tured by a multiplicative factor TQ\{k,z) to be ap- 
plied to the standard ACDM transfer function TA{k), 
i.e. Tq = TqaTa. (The ACDM transfer function can 
be taken from, e.g., Eisenstein & Hu 1999.) In contrast 
to models with no dark energy (Jig — 0) or ones with 
a nonzero cosmological constant, (fig ^ 0, w — —1), 
a Q-field with Qq > 0, w > —I adds power to P{k) at 
small wavenumbers, increasingly so as the contribution of 
quintessence to the energy density becomes larger. This 
effect introduces an additional dependence of the trans- 
fer function on redshift (Ma et al. 1999). At this time no 
such prescription for w < —1 phantom energy type mod- 
els has been published. Fortunately the modifications 
to the ACDM transfer function are non-negligible only 
on scales comparable to the horizon. On the sub-cluster 
scales relevant for our calculations it is thus possible to 
ignore these effects. 

3.4. Mass variance 

On the scale of galaxy clusters and below, the QCDM 
power spectrum has identical shape to the ACDM one 
and differs only in the overall amplitude. To fix the nor- 
malization factor A in equation (19), we compute the 
variance of the mass-density field smoothed on a scale of 
comoving radius R, 

al, = {{SM/Mf) = ^ / dk k^P(k, z)W\kR) , 

(20) 

where M = H^UmR'^ f^G is the mass inside a sphere 
of radius R, and W{kR) is the Fourier transform 
of the spherical top- hat window function, VF(a;) = 
(3/x^)(sinx/a; — cosx). For calculations of lensing by 
halos on the scale of clusters and smaller, it seems most 
appopriate to normalize the QCDM power spectrum by 
the local abundance of galaxy clusters.'* Although the 
normalization of a QCDM power spectrum by cluster 
abundances has been discussed by Wang & Steinhardt 
(1998), we present it in the Appendix since we use the 
more recent data set of Reiprich & B6hringc;r (2002). 
We find that the data favor a value of the rms mass 
fluctuations 8 h^^ Mpc spheres, (Ts = ""(^ = 0, i? = 
8h~^Mpc) = 0.74, for the currently favored cosmology 
with f^M = 0.3, fig = 0.7, h = 0.7, and n = 1, indepen- 
dently of w for —1.5<w< —0.5 (see the Appendix for 
details). Then the normalization factor in eq. (19) is 



A = 



2^V| 



J^dk fc"+2T2(fc,0)VK2(fci?8) 



(21) 



Following Bullock et al. (2001) we define as the 
mass of halos collapsing today from l-cr fluctuations, i.e. 
as the solution of ctm = 5c, where 5c is the linearly ex- 
trapolated overdensity at collapse (see §3.6). Table 1 

* By contrast, Sarbu ct al. (2001) compute lens statistics using 
a QCDM power spectrum normalized by the COBE measurement 
of the cosmic microwave background, corresponding to scales near 
the horizon scale today. 



Table 1. M,(«)) [lO^^ Mq] 
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shows M* for various values of w, for our favored value 
of cTs = 0.74 and also for ug = 0.90. 

3.5. Halo mass function 

The mass function of collapsed objects has been de- 
rived from theoretical arguments and from A'^-body sim- 
ulations. The classic prediction comes from Press & 
Schechter (1974), and it has been adjusted by Sheth 
& Tormen (1999) to better fit results from simulations. 
Jenkins et al. (2001) have presented an alternate fitting 
form that they claim fits a wide variety of A^-body sim- 
ulations. The functional forms of these three models 
are: 



nps(M): 



nsT(M): 



2 pb 5c ddM 



Mi 



'M 



dM 



exp 



0.383 Pb 5c da 



M 



a/tF M alj dM 



X exp 



0.707 



1 + 



(22a) 



0.707(5? 



0.3' 



A. 



(22b) 



nj(M) = 



Pq daM 0.315 
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exp [-|0.61 - ln(TM|^'1j2c) 



We adopt the Jenkins et al. mass function as our fidu- 
cial model, but we explore the effects of using the other 
approximations as well. While the original analysis by 

Jenkins et al. did not include any cases with w ^ — 1, 
recent results from A^-body simulations including the ef- 
fect of a dark energy component with w ^ —1 confirm the 
validity of the fitting function (Linder & Jenkins 2003; 
Kuhlen et al. , in preparation). 

3.6. Spherical top-hat collapse 

The PS and ST mass functions and the definition 

of depend on the linearly extrapolated overden- 
sity at collapse 5c- The SIS and NFW halo lensing 
cross sections depend on the virial overdensity Avir = 
{p/PM)vii- Both 5c and Ayir are determined by solving 
the spherical collapse model (e.g., Wang & Steinhardt 
1998; Strigari 2003, private communication). While 
the changes in 5c with w are only a few percent, the 
changes in Ayir are more noticeable. As w increases 
Avir becomes larger as structures form earlier. At 
2 = we find Avir = (499, 409, 346, 298, 270) for w = 
(-0.50,-0.75,-1.00,-1.25,-1.50) (Strigari 2003, pri- 
vate communication). For w > —1 fitting formulae 
for both quantities have been published by Weinberg & 
Kamionkowski (2002). 

4. UNDERSTANDING THE PARAMETERS 
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Table 2. Fiducial Model and Variations : 

4 7 

o 

o 

Parameter Fiducial Value Variation Values z ^ ^ 



Q.m 0.3 - ■& ; 

Q.q 0.7 - ^ r - 

w -1.00 -0.50,-0.75,-1.25,-1.50 ot , J 

n 1.0 - 12 3 

h (1.7 - zs 



dn/dM Jenkins PS, ST 

c* 9.0 6.5,12.4 

criogc 0.14 0.21,0.07,0.00 

logMd 0.0 12.0 

logMc 13.50 13.25,13.75 

a.. 0.0 0.2,0.4,0.6, VLBI 



From the previous two sections it is clear that there 
are a number of parameters that enter into lens statis- 
tics calculations. In this section wc; study how they affect 
the results. Our goal is not to exhaustively catalog and 
quantify all parameter dependences and covariances, but 
to identify the main qualitative features. To do this, we 
adopt a fiducial model (which shall be justified in §5), 
and then vary the parameters one at a time. The fidu- 
cial model and the variations are summarized in Table 2. 
Figure 1 shows the optical depth as a function of source 
redshift, the image separation distribution, and the time 
delay distribution for the fiducial model. Figures 2-4 
then show how the parameter variations affect the image 
separation distribution, the time delay distribution, and 
the lens redshift distribution. 

Note that the image separation distributions are nor- 
malized so the total probability reflects the probability 
of lensing, while the time delay and lens redshift distri- 
butions are normalized so the total probability is unity. 
This allows us to separate changes in the overall lens- 
ing probability from changes in the shapes of the time 
delay and lens redshift distributions. Also note that in 
Figures 2 and 3 wc plot the cumulative separation and 
time delay distributions, but for completeness in Fig- 
ure 1 we show the differential distributions. Finally, 
note that the calculations in this section are designed 
for comparison with the CLASS survey: we use a source 
redshift distribution with mean (z,) = 1.27, and wc com- 
pute magnification bias using the total magnification and 
a source luminosity function modeled as a power law 
dN/dS oc 5-21 (-ggg j^ygjjj ^ Tegmark 2001; Browne 
et al. 2002). 

4.1. The fiducial model 

In our fiducial model, the image separation distribution 

peaks around Ad ~ l"-2", and has a long tail to small 
separations (see Fig. 1). Above the peak the distribution 
drops quickly, then has a kink at A9 ~ 4" with a long 
but low-amplitude tail to large separations. This feature 
represents the transition from normal lenses produced by 
galaxies modeled as SIS halos, to wide-separation lenses 
produced by groups and clusters modeled as NFW ha- 
los. NFW halos arc much less efficient lenses than SIS, 
so wide-separation lenses are much less abundant than 
normal arcsecond-scale lenses. 

The time delay distribution is surprisingly broad, span- 
ning more than three orders of magnitude (also see Oguri 
et al. 2002). Thus, in future surveys a substantial frac- 




-1.0 .0.5 0.0 0.5 1 2 3 

log AB [arcsec] log At [days] 

Fig. 1. — Lens statistics results for the fiducial model. (Top) 
The total lensing probability as a function of the source redshift. 
(Bottom) The image separation and time delay distributions. 



tion of lenses will have time delays on week or month 
time scales for which measurements will be feasible, but 
a non-negligible fraction of lenses will have time delays 
that are too long to be of practical use. There is actually 
a kink in the time delay distribution marking the transi- 
tion from normal to wide-separation lenses, but it occurs 
at ~1100 days and is not visible in Figure 1. 

4.2. Variation in w 

As discussed in §3, the fact that dark energy mod- 
ifies cosmological dynamics is taken into account self- 
consistently in everything except for the effect on the 
NFW concentration parameter. Since overdensities col- 
lapse earlier as w increases, the median concentration 
should increase with w. Some analytical and numerical 
studies of halo concentrations in universes with w —1 
have been made (Bartelmann, Perrotta, & Baccigalupi 
2003; Klypin et al. 2003; Kuhlen et al. , in prepara- 
tion), but so far no c^{w) relations have been published. 
The explicit w dependence of c* (w) appears to be rather 
small, certainly less than the scatter a\ogc, so we neglect 
it in this work. 

Figure 2 shows that changing w affects the full range 
of image separations. The effects arc largest at wide sep- 
arations {A9 > 4"), with more than twice as many lenses 
for w = —0.50 as for ACDM . This makes sense, because 
wide-separation lenses are produced by massive group 
and cluster halos, which are more abundant at redshifts 
zi ^ 0.2-1 in models with larger w by virtue of the earlier 
structure formation. Increasing w slightly decreases the 
number of lenses with A9 < 2"; among the parameters 
we examine, this is the main change at arcsecond scales. 
An increase in w also shifts the time delay distribution to 
slightly higher values (Fig. 3), and shifts the lens redshift 
distribution to slightly lower values (Fig. 4). 

We conclude that wide-separation lenses can be used 
to place useful constraints on w only after parameters 
governing the abundance and structure of massive halos 
((78,5,^, (Tiogci Mf.) are much better constrained (cf. Sarbu 
et al. 2001). Normal arcsecond-scale lenses can provide 
more robust constraints on w, but large future samples 
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Fig. 2. — The eflfects of parameter variations on the cumulative distribution of image separations. 



will be needed to detect the relatively weak effects of w. 
In this regard our results differ from those of Huterer & 
Ma (2003), who claim an upper limit w < -0.85 at 68% 
confidence using the current CLASS lens sample. 

4.3. Variation in the mass function and as 

The strongest effect in this group comes from changes 
in as- In order to match the observed abundance of 
clusters from HIFLUGCS we have adopted a low value 
as = 0.74. Determinations of as at much larger scales 
(COBE, WMAP, weak lensing, excess power in CMB 
at I > 1000, etc.; Ma et al. 1999; Spergel et al. 2003; 
Huterer & White 2002; Komatsu & Seljak 2002), by con- 
trast, tend to favor a higher value ag ^ 0.90. At present 
there exists no satisfying reconciliation of this tension, 
so we are forced to treat as as an unknown. The seem- 
ingly small increase in erg from 0.74 to 0.90 produces a 
change of about an order of magnitude in the abundance 
of wide-separation lenses. The total lensing probability, 
however, is affected by only 10%. The increased abun- 
dance of wide-separation lenses, which have long time de- 
lays, shifts the time delay distribution to slightly higher 
values. 

The use of different mass hmctions. Press Schechter 
vs. Sheth-Tormen vs. Jenkins, produces a smaller effect. 
The PS mass function tends to over- (under-) predict 
the abundance of halos below (above) ^ 10"'^^ Mq rela- 
tive to TV-body simulations, while the ST and Jenkins 
mass functions both aim to correct this problem. Thus 



it is not surprising to see that the PS mass function tends 
to predict more small-separation lenses and fewer large- 
separation lenses than the ST/ J mass functions. Rela- 
tive to the Jenkins mass function, the ST mass function 
produces slightly fewer lenses at all separations, but the 
differences arc less than 10%. The tendency for the PS 
and ST to shift the image separation distribution to lower 
values also shifts the time delay distribution to lower val- 
ues; the shift in the median is 17% for ST and 25% for 
PS. 

4.4. Variation in source distribution 

Our fiducial model assumes that all sources lie at red- 
shift Zg = 1.27, but we consider several other possibili- 
ties as follows. First, we consider Gaussian distributions 
with the same mean but different widths az^ ■ Second, we 
consider using the source redshift distribution from the 
2nd Caltech/Jodrell Bank VLBI sample (Henstock et al. 
1997). This distribution is reasonably well described by 
a Gaussian with mean (zg) = 1.33 and width az^ = 0.75, 
but we use the actual observed distribution. This is the 
same redshift distribution used by Sarbu, Rusin, & Ma 
(2001) in their study of the effects of dark energy on 
strong lensing. 

Figure 2 shows that with the mean source redshift 
fixed, increasing the scatter even to az, ^ 0.4 has little 
effect on the optical depth or image separation distribu- 
tion. Only when the scatter reaches az^ ~ 0.6 is there 
a notable increase, which is due to the growing number 
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of sources at higher redshifts (where the optical depth 
is higher). The higher optical depth for the model with 
the VLBI redshift distribution (~40% higher than for the 
fiducial model) is due partly to the higher mean redshift 
and partly to the large width of the distribution. Not 
surprisingly, broadening the source redshift distribution 
causes the lens redshift distribution to show a larger tail 
to higher redshifts. 

4.5. Variation in Cf, and a\ogc 

The parameters c* and a\og c control the distribution of 
concentration parameters for NFW halos. They have lit- 
tle effect on arcsecond-scale lenses because those arc pro- 
duced by galaxies with roughly isothermal profiles. But 
they have an enormous effect on the abundance of wide- 
separations lenses produced by group and cluster halos. 
Increasing the median concentration c, by 0.14 dex (the 
la scatter found by Bullock et al. 2001) increases the 
abundance of wide-separation lenses by more than a fac- 
tor of 8, while decreasing c* by 0.14 dex decreases the 
abundance by a factor of ~15. Even more extreme is 
the effect of changing the scatter in the c(M, z) rela- 
tion. Neglecting the scatter (using (Tiogc = 0) reduces 
the abundance of wide-separation lenses by more than 
two(!) orders of magnitude. Both effects are explained 
by the fact that making a halo less concentrated can dra- 
matically reduce its lensing cross section and its ability 
to produce lenses of a given size. This, of course, was 
the original reason for using NFW halos in lens statis- 



tics: halos that are less concentrated than SIS are needed 
to explain the lack of observed wide-separation lenses. 

It is important to understand this result in more detail. 
In Figure 5 we show how varying the concentration (for 
fixed mass and redshift) affects the lensing properties of 
an NFW halo. Panel (b) shows a log- normal distribution 
of concentrations with width a\ogc = 0-14 dex. Panel (c) 
shows the corresponding distribution of the lensing effi- 
ciency parameter /Xg. When the mass is fixed, varying 
the concentration changes Vg and pa, and hence changes 
the lensing strength parameter jis ■ Because of the steep 
dependence on /i^ (panel a) , small changes in the concen- 
tration can produce large changes in the Einstein radius 
and radial caustic. As a result, a 0.14 dex scatter in con- 
centrations leads to a spread of some five orders of mag- 
nitude in the scale radii (panels d and e), or ten orders 
of magnitude in the lensing cross section (c^pw cx: R'^^). 

In Figure 6 we have plotted the lensing cross section 
fT/,(logio c), the Gaussian distribution of the concentra- 
tion $(logioc), and their product. All curves are nor- 
malized to unity at c = Cmcd- The increase in the 
lensing cross section with concentration outweighs the 
decrease in the Gaussian tail out to 4 standard devia- 
tions. A peak occurs at around logio('^/<^med)/o'iogc = 
2, where the product of cti, and $ is about two or- 
ders of magnitude larger than at Cmed- The weighted 
cross section is given by the integral over this product 
(/ (Tl$ dlogc/ J^dlogc), which explains the difference 
between a single NFW halo at c = Cmed and a distribu- 
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tion of NFW halos around this concentration with scatter 
equal to aiogc- 

Put another way, even a massive halo with the me- 
dian concentration has a relatively small Einstein radius 
and cross section, so if all halos have the median con- 
centration ((Tiogc = 0) then wide-separation lenses will 
be quite rare. Only the most concentrated halos have 
reasonable cross sections for producing wide-separation 
lenses. Thus, wide-separation lenses are expected to be 
abundant only if the median and/or scatter in the con- 
centration distribution is large enough to produce a siz- 
able population of massive, concentrated halos. We con- 
clude that it is imperative to include the distribution 
of concentrations in lens statistics calculations — and 
that the main limitation in predictions of wide-separation 
lenses will be uncertainties in and CTiogc- These ef- 
fects have been mentioned before (e.g., Keeton & Madau 
2001: Wyithe, Turner, & Spcrgcl 2001), but we were not 
previously aware that they could affect NFW lensing so 
dramatically. 

4.6. Variation in and 

The upper transition mass Mc parametrizes the change 
from galaxy-sized SIS to group- and cluster-sized NF^^' 
lenses, and its effect on the image separation distribu- 
tion is orthogonal to the effects of the other parame- 
ters. Specifically, whereas the main effects discussed so 
far are increases or decreases in the abundance of wide- 
separation lenses, what Mc does is determine the po- 



sition of the kink in the image separation distribution 
that marks the transition from normal to wide-separation 
lenses. For our fiducial value log Mc = 13.50 the transi- 
tion occurs at « 4", while for log Mc = 13.25 it shifts 
down to K. 3" and for log Mc = 13.75 it shifts up to 
A6' « 6". The lack of lenses with separations larger than 
3" in the CLASS statistical sample makes it possible to 
place upper Hmits on Mc (see §5). 

The lower transition mass M^ parametrizes the change 
from dwarf-sized NFW to galaxy-sized SIS lenses. Mak- 
ing Md non-zero causes a drop in the abundance of lenses 
(in direct analogy with the galaxy to cluster transition 
at Mc). But the effects arc not rc;a{lily apparent in the 
image separation distribution, because they occur only 
at the smallest image separations. Except for very large 
values logMd 12, the effect is confined to M < O'.'l. 
Because today's largest statistically complete lens sam- 
ple is complete only above 0'.'3, the lower transition mass 
Md is not well constrained by the available data (see §5 
for details). For this reason we have chosen to omit the 
lower transition from our fiducial model. 

Even more interesting are the effects of the transition 
masses on the time delay distribution. Increasing Mc ef- 
fectively increases the abundance of lenses with interme- 
diate separations A^ ^ 4" -6", which have long time de- 
lays; so it shifts the time delay distribution to higher val- 
ues. Applying the Md transition reduces the abundance 
of lenses with small separations, so it removes many of 
the lenses with short time delays and hence changes the 
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Fig. 5. — The change in the lensing properties of an NFW 
halo with concentration, (a) The strong dependence of the Ein- 
stein radius ilein and the position of the radial caustic -Rrc on the 
dimensionless lensing efficiency parameter /is- (b) A log- normal 
distribution of concentrations with width criogc = 0.14 dex. The 
histogram shows the distribution of values drawn from the parent 
distribution represented by the smooth curve, (c-e) The resulting 
distributions of the efficiency parameter /i^, the Einstein radius 
i?eini a-nd the radial caustic -Rcr- The vertical lines indicate the 
values corresponding to the median concentration. 
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Fig. 6. — t7L, the lensing cross section (dashed), a Gaussian 
distribution in logiQ(c) with width criogc = 0.14 dex (dotted), and 
the product of the two (solid), for the same lens-source system as 
in Figure 5. All curves have been normalized to unity at c = Cmed- 
Since (T_l is a steeply rising function, the product of peaks at 
around two standard deviations about the median, with a weighted 
cross section about two orders of magnitude larger than at Cmed- 



mology and the profiles of massive dark matter halos. 
Our results suggest that the answer is yes, because of in- 
formation contained in the distribution of lensed image 
separations (also see Huterer & Ma 2003). The abun- 
dance of wide-separation lenses depends primarily on as 
and the core properities of NFW halos (the distribution 
of concentrations; or equivalcntly, in generalized NFW 
models, the distribution of inner cusp slopes). The ex- 
act location of the break at A9 ^ 4" between common 
arcsecond-scale lenses and less abundant wide-separation 
lenses probes baryonic cooling processes that determine 
why galaxies and clusters have different density profiles. 
The time delay distribution depends on these processes 
as well. Finally, the efi^ects of w are seen at all image sep- 
arations. Future data should therefore be able to break 
degeneracies between parameters by considering not only 
the total number of lenses but also the separation distri- 
bution. 

5. COMPARISONS WITH THE JVAS/CLASS SURVEY 

Already we can place constraints on some of the pa- 
rameters using existing lens samples. The largest homo- 
geneous sample comes from the combination of the Jo- 
drell Bank/VLA Astrometric Survey (JVAS) and Cosmic 
Lens All-Sky Survey (CLASS), which in its entirety com- 
prises VLA observations of 16503 radio sources with 22 
multiply- imaged systems (Browne ct al. 2002). A subset 
containing 8958 sources and 13 lenses forms a statistically 
complete sample with well-defined selection criteria (My- 
ers et al. 2002). The survey is believed to be complete 
at A^^ > 0'.'3; the observed image separations range from 
0'.'33 to 4'.'55 in the fuU sample, and 0'.'33 to 2'.'57 in the 
statistical subsample. The source redshift distribution is 
not well known, but the mean redshift of a subsample of 
sources is (zs) = 1.27 (Marlow et al. 2000). 

In the range of angular separations probed by 
JVAS/CLASS the transition masses Md and Mc are 
the most sensitive parameters. Variations in c* and 
CTiogc have larger effects overall, but only in the wide- 
separation tail where the null results of the JVAS/CLASS 
survey precludes interesting constraints. While Huterer 
& Ma (2003) claim to be able to distinguish different val- 
ues of w and as using the CLASS statistical sample, we 
find that the effects of those parameters are smaller than 
the Poisson uncertainties. We therefore restrict attention 
to Md and Mc- 

For a statistical analysis of the shape of the image sep- 
aration distribution, it seems reasonable to use all 22 
lenses. The incompleteness of the full sample should not 
affect the normalized separation distribution. We quan- 
tify the agreement between our models and the data us- 
ing the Kolmogorov-Smirnov (K.S.) test (e.g., Press et 
al. 1992, §14.3). The K.S. statistic is 



^KS 



max |Pobs(<A0)-P^od(<A0)|, (23) 

0<A9<oo 



shape of the time delay distribution. Of all the param- 
eter variations we consider, Mc and Md have the most 
significant effects on the time delay distribution. 

4.7. General comments 

In previous analyses it was not clear whether lens 
statistics could simultaneously constrain both the cos- 



where Pobs(< A0) and Pi„od(< Aff) are the observed and 
predicted cumulative image separation distributions. As- 
sociated with Dks is the K.S. probability Pks(-D), quan- 
tifying the significance of an observed value of f as a 
disproof of the hypothesis that the data and the model 
represent the same distribution. Large values of Dks cor- 
respond to small values of Pks and indicate poor agree- 
ment between data and model. 
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Fig. 7.— Comparison with JVAS/CLASS by K.S. test. The 
histogram on the left shows the normaUzed cumulative image sep- 
aration distribution for all 22 lenses in the JVAS/CLASS sample. 
The solid line shows our best-fit model, determined by choosing 
and Mc to minimize the K.S. measure Dmax- The dotted vertical 
Une indicates the JVAS/CLASS completeness limit, = 0''3. 
The right panel shows contours of i^g with the maximum at 
Pks = 0.887 marked by the cross. 
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Fig. 8.— Comparison with JVAS/CLASS data by test. The 
histogram on the left shows the unnormalized differential image 
separation distribution for the 13 lenses in the statistically com- 
plete JVAS/CLASS subsamplc. The error bars represent Pois- 
son noise. The solid line shows our best-fit model, determined 
by choosing M^j and Mc to minimize . The dotted vertical line 
indicates the JVAS/CLASS completeness limit for A0. The right 
panel shows contours of with the minimum marked by the cross. 



Figure 7 shows contours of D^g versus the transition 
masses Md and Mc- The best fit (minimum value of 
Dks) occurs at logM^ = 12.38 and logM^ = 13.42, 
and has a probabihty Pks = 88.7%. This value of the 
galaxy/cluster transition mass agrees well with the value 
found by Huterer & Ma (2003) from a similar analysis 
of the CLASS sample. The large inferred value of Md is 
surprising; if taken at face value, it implies that even a 
relatively massive galaxy like the Milky Way is a "dwarf" 
whose density profile is NFW. It is not clear whether this 
result reflects a shortcoming of our theoretical modeling, 
or an unidentified selection eff'ect in the JVAS/CLASS 
sample, or some other effect. 

The K.S. test applies only to the shape of the im- 
age separation distribution and is insensitive to the total 
lensing probability. As a complementary approach, we 
use a test to compare the unnormalized differential 
distributions; this test is somewhat less sensitive to the 
shapes of the distributions, but very sensitive to the rel- 
ative normalizations. We take care to use only the sta- 
tistically complete JVAS/CLASS subsample. We bin the 
13 lenses into seven bins of width log = 0.2; although 
the two right-most bins are empty, we include them be- 
cause our models can predict non-negligible numbers of 
lenses in this regime. 

Figure 8 shows contours of versus the transition 
masses. The best-fit model has = 4.39 for seven 
bins. We obtain good constraints on the upper tran- 
sition mass Mc, with a best-fit value of log Mc = 13.46 
in good agreement with the results of the K.S. test. For 
Md, by contrast, the surface fiattens into a long valley 
at log Md < 12. We are therefore unable to place a strong 
constraint on Md- At 68% confidence level we can only 
place an upper bound of logM^; < 12.57. As mentioned 
in §4.6, the effect of lowering Md below 10^^ Mq is con- 
fined to very small angular separations logA^ < —0.5, 
outside of the range probed by JVAS/CLASS. Future 
surveys extending to smaller image separations will be 
needed to robustly constrain Md- 

We note that although our models produce an accept- 
able value of x^ ! they cannot account for the sharp peak 
in the observed distribution at log ~ 0'.'2. The pres- 
ence of this peak actually explains why the K.S. test pro- 
duces such a large best-fit value for M^: being sensitive 
only to the shape of the separation distribution, the K.S. 



test prefers to raise Md to cause a rapid fall-off below 
the observed peak. Doing so sharply reduces the total 
lensing probability, however, and thus is disallowed by 
the x^ test. At this time it is not clear how to interpret 
these results, or even how to interpret the sharp peak in 
the observed distribution. Given that the sample size is 
still small and that even the high bin contains just seven 
lenses, it could just be due to a statistical fluctuation. 

Nevertheless, the fact that we can reproduce the data 
so well without flne-tuning seems to validate our fiducial 
model. 

6. FORECASTS 

Although the JVAS/CLASS sample is small, ongoing 
and future surveys should dramatically increase the size 
of statistically complete lens samples and make it pos- 
sible to constrain the model parameters. Wc can use 
our model to forecast the number of lenses that should 
be found in various surveys, and to understand when 
and how it may be possible to constrain the various 
model parameters. The most relevant surveys are deep, 
wide-field optical imaging surveys. The sources we con- 
sider are quasars, because they are common, distant, and 
straightforward to identify on the basis of optical colors 
(e.g.. Groom et al. 2001: Richards et al. 2002). See Holz 
(2001), Goobar et al. (2002), and Oguri, Suto, & Turner 
(2003a) for estimates of the number of lenses in other 
populations, such as supernovae. 

To compute the expected number of lensed quasars we 
must combine our calculated lensing probability with a 
description of the quasar population: 

iVlens = JdzJdS Piens(^, S) , (24) 

where dNq/dzdS is the distribution of quasars in red- 
shift and flux, and fiens is the lensing probability. We 
model the quasar population using the quasar lumi- 
nosity function derived from the 2dF quasar rcdshift 
survey. Boyle et al. (2000) find that the luminosity 
function can be well described as a double power law, 
$ (X [{L/UY + {L/L^f]-'^, where the power law in- 
dices are a = 3.41 and /3 = 1.58 and the break mag- 
nitude evolves as M*(z) = M*(0) - 3.4z + 0.68^^ for 
AGDM . While the 2dF quasar survey extends to red- 
shift z = 2.3, we extrapolate the luminosity function 
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Fig. 9. — Predicted number counts of lensed quasars as a function 
of the survey limiting magnitude, for the fiducial model. The top 
curve depicts all lenses, and the remaining ones show lenses with 
image separations AS > 0.3, 1, 2, 3, 4, 6, 10 arcsec (from top to 
bottom). The line in the upper left corner shows how the survey 
exposure time scales with limiting magnitude. 



to 2: ~ 5. Deeper quasar surveys indicate a bright end 
slope a»2.5atz^4. suggesting that the pure huni- 
nosity evolution model may break down at high redshift 
(Fan et al. 2001). This should not significantly affect 
our results, however, because even at faint magnitudes 
no more than ~10% of the predicted lenses come from 
quasars at z > 4. To convert rest frame B-band luminosi- 
ties to apparent magnitudes in various observed bands, 
we compute colors and X-corrections by convolving filter 
transmission curves with the composite quasar spectrum 
given by Vanden Berk et al. (2001). In the integral to 
compute the lensing magnification bias (sec eqs. 6 and 
14) we use the magnification of the fainter image, which 
is appropriate when we want to identify lenses where both 
images are above the flux limit. 

Figure 9 shows our fiducial predictions for the number 
of lensed quasars per square degree, as a function of the 
R-band limiting magnitude of a survey, for a variety of 
cuts on the image separation. We have also computed the 
number counts for the V and I passbands. We find that 
these curves are very similar to those in Figure 9 but 
simply offset horizontally by approximately ±0.4 mag; 
thus, we can use a single set of curves to determine the 
number counts in any of the three passbands. 

An important qualitative result is that the lens number 
counts increase less rapidly than the exposure time as the 
depth increases. Making a survey 1 mag deeper increases 
the exposure time by a factor of 6.3 but only increases 
the number of lenses by a factor of ~1.7-3.3 (depending 
on the original depth). By contrast, keeping the same 
depth but increasing the area by a factor of 6.3 would 
increase the number of lenses by the same factor of 6.3. 
If the main goal of a survey is to find as many lenses as 
possible in a fixed amount of telescope time, then a wide 
area is more valuable than a deep limiting magnitude. 

Figure 10 shows the predicted redshift distribution for 
lensed quasars. The distribution peaks at z ^ 2.5; it 
declines at low redshift mainly because the lensing op- 
tical depth becomes small, and at high redshift because 



Fig. 10. — Predicted redshift distribution of lensed quasars. 
The different curves correspond to different R-band limiting mag- 
nitudes. 



quasars become rare. The distribution shifts to slightly 

higher redshifts and becomes slightly broader at fainter 
limiting magnitudes, but the change is not large. Even 
at faint limiting magnitudes no more than 10% of the 
predicted lensed quasars have redshifts z > 4, so we be- 
lieve that uncertainty in the quasar luminosity function 
at high redshift (whether the pure luminosity evolution 
model is viable at z > 4) does not significantly affect our 
results. 

Finally, Table 3 gives predictions for the number of 
lensed quasars in current and future imaging surveys, 
obtained by combining our predicted number counts 
with the surveys' limiting magnitudes and areas. The 
systematic uncertainties are probably 10 20% for nor- 
mal arcsecond-scale lenses, but much larger for wide- 
separation lenses (see §4). At present about 80 lenses 
are known, but the largest statistical sample is 13 radio 
lenses from CLASS. The ongoing, ground-based NO AO 
Deep Wide-Field Survey (Jannuzi & Dey 1999) and the 
Deep Lens Survey^ could each produce a sample as large 
as CLASS, or even larger if lenses smaller than 1" can be 
identified. The Sloan Digital Sky Survey by itself should 
more than double the number of lenses; several new SDSS 
lenses have already been discovered (e.g., Inada et al. 
2003a, 2003b; Johnston et al. 2003). The proposed Su- 
pernova/Acceleration Probe (SNAP; Kim et al. 2002) 
satellite should discover a thousand or more lenses, and 
measure time delays for the several hundred discovered 
in the "deep" monitoring mode. The real breakthrough 
could come from the Large Synoptic Survey Telescope 
(Tyson et al. 2002), which would monitor 3/4 of the 
sky to reasonable depths and could discover some 10,000 
lenses. 

The search for lenses in these surveys will exploit the 
multiple colors that they all contain. Color cuts can be 
used to identify quasar candidates (e.g., Croom et al. 
2001; Richards et al. 2002). Lens candidates can then be 
selected either as pairs of quasars separated by a few 
arcseconds, or as objects that have quasar colors but 

® See http://dls.bell-labs.com. 
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Table 3. Survey Predictions 



Survey 


Area 




Depth 




Lcnscd C 


Quasars 






(□°) 






all 


>1" 


>3" 


>6" 


NOAO DWFS" 


18 


R 


= 25.8 


26 


11 


2 


0.03 


DLS*- 


28 


R 


» 26 


45 


19 


4 


0.1 


SDSS= 


10,000 


i* 


= 20.2= 


179 


73 


15 


0.5 


SNAP Deep<^ 


15 


I ' 


- 30 


269 


110 


22 


0.3 


SNAP Wide"^ 


300 


I ' 


^- 28 


1,820 


747 


147 


2 


LSST*" 


30.000 


R 


~ 21 


11.700 


6.010 


1.160 


18 



Note. — "NOAO Deep Widc-Ficid Survey (Jannuzi & Dey 
1999); sec http://www.noao.edu/noao/noaodeep. 
''Deep Lens Survey; see http : //dls .bell-labs . com. 
"^Sloan Digital Sky Survey (e.g., Abazajian et al. 2003); see 
http://www.sdds.org. The SDSS magnitude limit is i* = 19.1 
for quasars at redshifts z < 3 and i* = 20.2 for quasars at z > 3 
(Riehards et al. 2002). 

''Supernova/Acceleration Probe; see http: //snap . Ibl . gov. Cur- 
rent mission parameters inelude a "deep" mode writh deep monitor- 
ing of 15 and a "wide" mode with shallower imaging of 300 
(Kim et al. 2002). 

■^Large-aperture Synoptic Survey Telescope (Tyson et al. 2002); see 
http : / /www . Isst . org. 



are marginally resolved (i.e., small-separation lenses in 
ground-based surveys). Such lens search techniques are 
being refined for large surveys by SDSS (Inada et al. 
2003a; Pindor et al. 2003). 

Spectroscopic confirmation of faint lens candidates 
may be difficult at present, although not with future 
30-metcr class telescope. Still, other confirmation meth- 
ods are available. First, candidates with four (or more) 
lensed images can often be confirmed by the image con- 
figurations alone, and this should account for at least 25- 
30% and perhaps as many as 60% of the lenses (based 
on currc;iit predicted and observed 4-image to 2-image 
lens ratios; Keeton, Kochanek, & Seljak 1997; Rusin 
& Tegmark 2001). Second, surveys that include near- 
infrared imaging may discover infrared Einstein rings, 
which are lensed images of the quasar host galaxies 
that provide unambiguous evidence for lensing (e.g., 
Kochanek, Keeton, & McLeod 2001). Finally, monitor- 
ing programs like SNAP and LSST would offer the first 
chance to confirm lenses via time delays. 

7. CONCLUSIONS 

The statistics of strong gravitational lenses depend on 
a number of parameters related to both cosmology and 
the internal structure of the lenses. We have developed 



the full formalism for computing lens statistics in uni- 
verses dominated by quintessence. We have also shown 
=how variations in the parameters lead to changes in the 
distribution of image separations, time delays, and lens 
-redshifts. The strongest effects are found in the image 
separation distribution. Most lenses have image separa- 
tions of around one arcsccond, corresponding to lensing 
by individual galaxies. In this regime, the most impor- 
tant parameters are w and erg) which produce changes in 
the number of lenses at the tens of percent level. These 
effects are too small to be probed with existing lens sur- 
veys like CLASS (but see Hutercr & Ma 2003), but will 
not be out of reach of the large samples that will be found 
with ongoing and future surveys. 

The most dramatic parameter dependences are found 
among wide-separation lenses produced by groups and 
clusters of galaxies. The abundance of these lenses is ex- 
tremely sensitive to the distribution of "concentration" 
parameters that describe the inner structure of massive 
dark matter halos, and somewhat less sensitive to the 
abundance of such halos. Wide-separation lenses will 
always be rare, but the remarkable sensitivity to the 
parameters means that even a small number of lenses 
with separations greater than ^ 5" should yield inter- 
esting constraints. Indeed, the first instance of a wide- 
separation lens in a statistical sample has recently been 
discovered and is being analyzed for constraints on erg 
and the core structure of cluster halos (Inada et al. 2003b; 
Oguri et al. 2003b). 

Imaging surveys that are already underway will sub- 
stantially increase the sample of strong lenses, and fu- 
ture surveys promise even more. Predicted samples of 
hundreds or thousands of lenses will revolutionize lens 
statistics, provided that their selection effects are well 
understood. A robust measurement of the distribution 
of image separations will yield internally self-consistent 
tests of cosmological parameters, the dark matter den- 
sity profiles predicted by the popular cold dark matter, 
and the physics of baryonic cooling in massive halos. 
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APPENDIX 

POWER SPECTRUM NORMALIZATION 

We have normalized our power spectrum to match the observed abundance of galaxy clusters today. The largest 
and most recent X-ray flux-limited sample has been assembled by Reiprich & Bohringer (2002) and is known as 
HIFLUGCS. The cluster sample is based on the ROSAT All-Sky Survey and is composed of the tightest 63 clusters 
with galactic latitude > 20° and flux ./■,y(0.1 - 2.4 keV) > 2 x 10"" ergs s^^ cm^^ 

From the published data (Reipric;h & Bohringer 2002) we have taken the cluster redshift (z), the flux in the energy 
range 0.1 — 2.4 keV (/x), and Af2()o. the mass enclosed in a region in which the; average mass density is 200 times the 
critical density, and calculated an X-ray cluster mass function, analogous to the work by Reiprich & Bohringer (2002). 
The mass function was determined using the classical Vmax method: 
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Fig. All.— ACDM HIFLUGCS data and best-fit (cts = 0.74) Jenkins et al. (2001) mass function. 



where AA^f denotes the width of the mass bin, and Vmax.i = ^max(ix,i) is the maximum comoving volume within 

which the i^^ cluster with luminosity Lx,i could have been detected given the flux limit of the sample. Reiprich & 
Bohringer (2002) found a correlation between the total mass of the clusters and their X-ray luminosity, but with a 
significant scatter. An advantage of using Vmax{Lx) in the calculation of the mass function is that this scatter is 
automatically taken into account. 

There is a slight cosmology dependence of the mass function calculated this way, stemming from the use of the 
luminosity distance in converting fx to Lx, but the effect is never larger than 5% and always smaller than the Poisson 
noise due to the limited number of clusters per bin. 

The last step in the cluster normalization of the power spectrum is the matching of a theoretical mass function. 
As explained in § 3.5 we have chosen the Jenkins et al. (2001) mass function, which depends on the power spectrum 
through (Tm, sec eqs. (20) and (22c). Wc simply performed a minimization to find the value of cts that produced the 
best-fit between the mass function and the HIFLUGCS X-ray cluster mass function. The w; = — 1 mass function and 
the best- fit mass function are shown in Figure All. The w-dependence enters this method in two minor ways: the first 
is the aforementioned cosmology dependence of Vmax, and the second arises from the transfer function. Neither of these 
two effects is very significant, however, and so it is no surprise that the resulting cts's are identical within uncertainties. 
We find erg = 0.738, 0.740, and 0.742 for w = —1, —0.75, and — 0.5, respectively. The value o-g = 0.74 is consistent with 
what Reiprich & Bohringer (2002) found, who performed a more extensive analysis and found erg = OASflJ^'^^, which 
evaluates to as = 0.68 for Qm = 0.3. Note, however, that their full analysis prefers Qm = 0.12tQ Q4 and erg = 0.96to!i2- 
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